Reinforcement learning - PDFSEARCH.IO - Document Search Engine

Reinforcement learning
Results: 1147

#	Item
901	Hedged learning: Regret-minimization with learning experts Yu-Han Chang [removed] CSAIL, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA[removed]USA Leslie Pack Kaelbling Add to Reading List Source URL: www.machinelearning.org Language: English - Date: 2008-12-01 11:15:56 Game theory Cybernetics Machine learning Search algorithms Learning Reinforcement learning Markov decision process Multi-armed bandit Algorithm Statistics Mathematics Applied mathematics
902	Parameterized Maneuver Learning for Autonomous Helicopter Flight Jie Tang, Arjun Singh, Nimbus Goehausen, and Pieter Abbeel Abstract— Many robotic control tasks involve complex dynamics that are hard to model. Hand-spe Add to Reading List Source URL: www.cs.berkeley.edu Language: English - Date: 2010-06-09 03:17:59 Ballistics Aerodynamics Mechanics Trajectory Aerobatics Reinforcement learning Stall Flight Aerospace engineering Motion
903	Reinforcement learning with Gaussian processes Yaakov Engel Dept. of Computing Science, University of Alberta, Edmonton, Canada Shie Mannor Dept. of Electrical and Computer Engineering, McGill University, Montreal, Cana Add to Reading List Source URL: www.machinelearning.org Language: English - Date: 2008-12-01 11:15:01 Control theory Linear filters Stochastic differential equations Kalman filter Markov decision process Normal distribution Gaussian process Q-learning SARSA Statistics Markov models Stochastic processes
904	Dynamic Analysis of Multiagent Q-learning with Exploration ǫ-greedy Eduardo Rodrigues Gomes Add to Reading List Source URL: www.machinelearning.org Language: English - Date: 2009-05-18 12:17:09 Multi-agent systems Reinforcement learning Q-learning Agent-based model Action selection Affect Machine learning Intelligent agent Artificial intelligence Science Mind
905	Yaser S. Abu-Mostafa is a professor of electrical engineering and computer science at the California Institute of Technology. ARTIFICIAL INTELLIGENCE Add to Reading List Source URL: work.caltech.edu Language: English - Date: 2012-07-11 00:45:11 Learning Netflix Algorithm Supervised learning Recommender system Reinforcement learning Cluster analysis Overfitting Concept learning Statistics Machine learning Artificial intelligence
906	Learning All Optimal Policies with Multiple Criteria Leon Barrett Srini Narayanan 1947 Center St. Ste. 600, Berkeley, CA 94704 Add to Reading List Source URL: www.machinelearning.org Language: English - Date: 2008-05-22 03:19:26 Dynamic programming Markov processes Stochastic control Operations research Mathematical optimization Markov decision process Q-learning Reinforcement learning Convex hull Mathematics Algebra Statistics
907	Table of Contents Preface .................................................................................................................................................................... xiii Organization ........... Add to Reading List Source URL: www.icml2010.org Language: English - Date: 2010-07-25 05:16:19 Supervised learning Cluster analysis Bayesian network Regression analysis Reinforcement learning Hidden Markov model Michael I. Jordan Statistical classification Semi-supervised learning Statistics Machine learning Artificial intelligence
908	Exploration and Apprenticeship Learning in Reinforcement Learning Pieter Abbeel Andrew Y. Ng Computer Science Department, Stanford University Stanford, CA 94305, USA Add to Reading List Source URL: www.machinelearning.org Language: English - Date: 2008-12-01 11:16:12 Estimation theory Statistical theory Dynamic programming Markov decision process Reinforcement learning Markov chain Maximum likelihood XTR Statistics Markov processes Markov models
909	Non-Parametric Policy Gradients: A Unified Treatment of Propositional and Relational Domains Kristian Kersting [removed] Dept. of Knowledge Discovery, Fraunhofer IAIS, Schloss Birlinghoven, 537 Add to Reading List Source URL: www.machinelearning.org Language: English - Date: 2008-05-02 08:01:38 Ensemble learning Operations research Reinforcement learning Gradient boosting Boosting Regression analysis Mathematical optimization Function Supervised learning Machine learning Mathematics Statistics
910	Preconditioned Temporal Difference Learning Hengshuai Yao Zhi-Qiang Liu School of Creative Media, City University of Hong Kong, Hong Kong, China Add to Reading List Source URL: www.machinelearning.org Language: English - Date: 2008-05-23 03:34:40 Preconditioner Statistics Reinforcement learning Iterative method Markov chain Matrix Sparse matrix Applied mathematics Markov models Numerical linear algebra Mathematics

UPDATE